Co-Clustering Under the Maximum Norm

نویسندگان

  • Laurent Bulteau
  • Vincent Froese
  • Sepp Hartung
  • Rolf Niedermeier
چکیده

Co-clustering, that is, partitioning a numerical matrix into “homogeneous” submatrices, has many applications ranging from bioinformatics to election analysis. Many interesting variants of co-clustering are NP-hard. We focus on the basic variant of co-clustering where the homogeneity of a submatrix is defined in terms of minimizing the maximum distance between two entries. In this context, we spot several NP-hard as well as a number of relevant polynomial-time solvable special cases, thus charting the border of tractability for this challenging data clustering problem. For instance, we provide polynomial-time solvability when having to partition the rows and columns into two subsets each (meaning that one obtains four submatrices). When partitioning rows and columns into three subsets each, however, we encounter NP-hardness even for input matrices containing only values from {0, 1, 2}.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Inner Product and Fuzzy Norm \of Hyperspaces

We introduce and  study  fuzzy (co-)inner product and fuzzy(co-)norm of hyperspaces. In this regard by considering  the notionof hyperspaces, as a generalization of vector spaces, first we willintroduce the notion of fuzzy (co-)inner product in hyperspaces and will apply it to formulate the notions offuzzy (co-)norm and fuzzy (co-)orthogonality  in hyperspaces. Inparticular, we will prove that ...

متن کامل

Web - based Supplementary Materials for “ Comparing Large Co - variance Matrices under Weak Conditions on the Dependence Structure and its Application to Gene Clustering ” , by Jinyuan

First we introduce some of the basic notations. For any vector u = (u1, . . . , up) T ∈ R, denote by |u|q the vector `q-norm defined by |u|q = (∑p k=1 |uk| )1/q for q ≥ 1 and write |u|0 = ∑p k=1 I(uk 6= 0). For any set S, denote by S its complement. For a matrix A = (ak`) ∈ Rp×p, we denote by ‖A‖2 the spectral norm, ‖A‖F the Frobenius norm, and ‖A‖1 = ∑p k,`=1 |ak`| the elementwise `1-norm. Rec...

متن کامل

Impact of the Choice of Normalization Method on Molecular Cancer Class Discovery Using Nonnegative Matrix Factorization

Nonnegative Matrix Factorization (NMF) has proved to be an effective method for unsupervised clustering analysis of gene expression data. By the nonnegativity constraint, NMF provides a decomposition of the data matrix into two matrices that have been used for clustering analysis. However, the decomposition is not unique. This allows different clustering results to be obtained, resulting in dif...

متن کامل

Approximation Algorithms for Bregman Clustering Co-clustering and Tensor Clustering

The Euclidean K-means problem is fundamental to clustering and over the years it has been intensely investigated. More recently, generalizations such as Bregman k-means [8], co-clustering [10], and tensor (multi-way) clustering [40] have also gained prominence. A well-known computational difficulty encountered by these clustering problems is the NP-Hardness of the associated optimization task, ...

متن کامل

Clustering, Hamming Embedding, Generalized LSH and the Max Norm

We study the convex relaxation of clustering and hamming embedding, focusing on the asymmetric case (co-clustering and asymmetric hamming embedding), understanding their relationship to LSH as studied by [7] and to the max-norm ball, and the differences between their symmetric and asymmetric versions.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014